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Abstract 



This document is intended as a study of benchmark cross sections at the LHC 
(at 7 TeV) at NLO using modem PDFs currently available from the 6 PDF 
fitting groups that have participated in this exercise. It also contains a succinct 
user guide to the computation of PDFs, uncertainties and correlations using 
available PDF sets. 

A companion note provides an interim summary of the current recommenda- 
tions of the PDF4LHC working group for the use of parton distribution func- 
tions (PDFs) and of PDF uncertainties at the LHC, for cross section and cross 
section uncertainty calculations. 
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1. Introduction 

The LHC experiments are currently producing cross sections from the 7 TeV data, and thus need accurate 
predictions for these cross sections and their uncertainties at NLO and NNLO. Crucial to the predictions 
and their uncertainties are the parton distribution functions (PDFs) obtained from global fits to data from 
deep-inelastic scattering, Drell-Yan and jet data. A number of groups have produced publicly available 
PDFs using different data sets and analysis frameworks. It is one of the charges of the PDF4LHC working 
group to evaluate and understand differences among the PDF sets to be used at the LHC, and to provide 
a protocol for both experimentalists and theorists to use the PDF sets to calculate central cross sections 
at the LHC, as well as to estimate their PDF uncertainty. This current note is intended to be an interim 
summary of our level of understanding of NLO predictions as the first LHC cross sections at 7 TeV are 
being produced 0. The intention is to modify this note as improvements in data/understanding warrant. 

For the purpose of increasing our quantitative understanding of the similarities and differences 
between available PDF determinations, a benchmarking exercise between the different groups was per- 
formed. This exercise was very instructive in understanding many differences in the PDF analyses: 
different input data, different methodologies and criteria for determining uncertainties, different ways 
of parametrizing PDFs, different number of parametrized PDFs, different treatments of heavy quarks, 
different perturbative orders, different ways of treating (as an input or as a fit parameter), different 
values of physical parameters such as itself and heavy quark masses, and more. This exercise was 
also very instructive in understanding where the PDFs agree and where they disagree: it established a 
broad agreement of PDFs (and uncertainties) obtained from data sets of comparable size and it singled 
out relevant instances of disagreement and of dependence of the results on assumptions or methodology. 

The outline of this interim report is as follows. The first three sections are devoted to a description 
of current PDF sets and their usage. In Sect. IZjwe present several modem PDF determinations, with 
special regard to the way PDF uncertainties are determined. First we summarize the main features 
of various sets, then we provide an explicit users' guide for the computation of PDF uncertainties. In 
Sect.[33we discuss theoretical uncertainties on PDFs. We first introduce various theoretical uncertainties, 
then we focus on the uncertainty related to the strong coupling and also in this case we give both a 
presentation of choices made by different groups and a users' guide for the computation of combined 
PDF-i-Os uncertainties. Finally in Sect.|4]]we discuss PDF correlations and the way they can be computed. 

In Sect. [S^we introduce the settings for the PDF4LHC benchmarks on LHC observables, present 
the results from the different groups and compare their predictions for important LHC observables at 7 
TeV at NLO. In Sect. [6]] we conclude and briefly discuss prospects for future developments. 



'Comparisons at NNLO for W,Z and Higgs production can be found in ref. |T| 
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2. PDF determinations - experimental uncertainties 

Experimental uncertainties of PDFs determined in global fits (usually called "PDF uncertainties" for 
short) reflect three aspects of the analysis, and differ because of different choices made in each of these 
aspects: (1) the choice of data set; (2) the type of uncertainty estimator used which is used to deter- 
mine the uncertainties and which also determines the way in which PDFs are delivered to the user; (3) 
the form and size of parton parametrization. First, we briefly discuss the available options for each 
of these aspects (at least, those which have been explored by the various groups discussed here) and 
summarize the choices made by each group; then, we provide a concise user guide for the determina- 
tion of PDF uncertainties for available fits. We will in particular discuss the following PDF sets (when 
several releases are available the most recent published ones are given in parenthesis in each case): 
AB KM/ABM 1113, CTEQ/CT (CTEQ6.6 [4|, CTIO [5|), GJR HERAPDF (HERAPDFl.O [8]), 

MSTW (MSTW08 f9\), NNPDF (NNPDF2.0 [lOJ). There is a significant time-lag between the develop- 
ment of a new PDF and the wide adoption of its use by experimental collaborations, so in some cases, 
we report not on the most up-to-date PDF from a particular group, but instead on the most widely-used. 

2.1 Features, tradeoiTs and choices 

2.11 Data Set 

There is a clear tradeoff between the size and the consistency of a data set: a wider data set contains 
more information, but data coming from different experiment may be inconsistent to some extent. The 
choices made by the various groups are the following: 

• The CTEQ, MSTW and NNPDF data sets considered here include both electroproduction and 
hadroproduction data, in each case both from fixed-taiget and collider experiments. The electro- 
production data include electron, muon and neutrino deep-inelastic scattering data (both inclusive 
and charm production). The hadroproduction data include Drell-Yan (fixed target virtual photon 
and colUder W and Z production) and jet production H. 

• The GJR data set includes electroproduction data from fixed-target and collider experiments, and 
a smaller set of hadroproduction data. The electroproduction data include electron and muon 
inclusive deep-inelastic scattering data, and deep-inelastic charm production from charged leptons 
and neutrinos. The hadroproduction data includes fixed-target virtual photon Drell-Yan production 
and Tevatron jet production. 

• The AB KM/ABM data sets include electroproduction from fixed-target and collider experiments, 
and fixed-target hadroproduction data. The electroproduction data include electron, muon and 
neutrino deep-inelastic scattering data (both inclusive and charm production). The hadropro- 
duction data include fixed-target virtual photon Drell-Yan production. The most recent version, 
ABM 10 fTTl, includes Tevatron jet data. 

• The HERAPDF data set includes all HERA deep-inelastic inclusive data. 

2.12 Statistical treatment 

Available PDF determinations fall in two broad categories: those based on a Hessian approach and those 
which use a Monte Carlo approach. The delivery of PDFs is different in each case and will be discussed 
in Sect.O 

Within the Hessian method, PDFs are determined by minimizing a suitable log-likelihood func- 
tion. Different groups may use somewhat different definitions of x^, for example, by including entirely, 

^Although the comparisons included in tiiis note are only at NLO,we note that, to date, the inclusive jet cross section, unlike 
the other processes in the list above, has been calculated only to NLO, and not to NNLO. This may have an impact on the 
precision of NNLO global PDF fits that include inclusive jet data. 
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or only partially, correlated systematic uncertainties. While some groups account for correlated uncer- 
tainties by means of a covariance matrix, other groups treat some correlated systematics (specifically but 
not exclusively normalization uncertanties) as a shift of data, with a penalty term proportional to some 
power of the shift parameter added to the x^- The reader is referred to the original papers for the precise 
definition adopted by each group, but it should be born in mind that because of all these differences, 
values of the quoted by different groups are in general only roughly comparable. 

With the covariance matrix approach, we can define = J^iji^^i ~ di)covij{dj — dj), di 
are data, di theoretical predictions, A'^dat is the number of data points (note the inclusion of the factor 
in the definition) and covjj is the covariance matrix. Different groups may use somewhat different 
definitions of the covariance matrix, by including entirely or only partially correlated uncertainties. The 
best fit is the point in parameter space at which is minimum, while PDF uncertainties are found 
by diagonalizing the (Hessian) matrix of second derivatives of the at the minimum (see Fig. [U and 
then determining the range of each orthonormal Hessian eigenvector which corresponds to a prescribed 
increase of the function with respect to the minimum. 

In principle, the variation of the which corresponds to a 68% confidence (one sigma) is Ax^ = 
1. However, a larger variation Ax^ = T^, with T > 1 a suitable "tolerance" parameter 1 12l [T3l [T4l 
may turn out to be necessary for more realistic error estimates for fits containing a wide variety of in- 
put processes/data, and in particular in order for each individual experiment which enters the global fit 
to be consistent with the global best fit to one sigma (or some other desired confidence level such as 
90%). Possible reasons why this is necessary could be related to data inconsistencies or incompatibili- 
ties, underestimated experimental systematics, insufficiently flexible parton parametrizations, theoretical 
uncertainties or approximation in the PDF extraction. At present, HERAPDF and ABKM use Ax^ = 1, 
GJR uses T w 4.7 at one sigma (corresponding to T w 7.5 at 90% c.L), CTEQ6.6 uses T = 10 at 
90% c.l. (corresponding to T 6.1 to one sigma) and MSTW08 uses a dynamical tolerance f9l, i.e. a 
different value of T for each eigenvector, with values for one sigma ranging from Tss ltorRi6.5 and 
most values being 2 < T < 5. 

Within the NNPDF method, PDFs are determined by first producing a Monte Carlo sample of 
A'^rcp pseudo-data replicas. Each replica contains a number of points equal to the number of original data 
points. The sample is constructed in such a way that, in the limit A'^rop oo, the central value of the 
i-th data point is equal to the mean over the A^rep values that the i-th point takes in each replica, the 
uncertainty of the same point is equal to the variance over the replicas, and the correlations between any 
two original data points is equal to their covariance over the replicas. From each data replica, a PDF 
replica is constructed by minimizing a x^ function. PDF central values, uncertainties and correlations 
are then computed by taking means, variances and covariances over this replica sample. NNPDF uses 
a Monte Carlo method, with each PDF replica obtained as the minimum x^ which satisfies a cross- 
validation criterion |[T5l [TOl . and is thus larger than the absolute minimum of the x^- This method has 
been used in all NNPDF sets from NNPDF 1.0 onwards. 

2.13 Parton parametrization 

Existing parton parametrizations differ in the number of PDFs which are independently parametrized 
and in the functional form and number of independent parameters used. They also differ in the choice of 
individual linear combinations of PDFs which are parametrized. In what concerns the functional form, 
the most common choice is that each PDF at some reference scale Qo is parametrized as 

fi{xM=Nx^^il-xf^gi{x) (1) 

where gi{x) is a function which tends to a constant both for x — )• 1 and x — )• 0, such as for instance 
gi{x) = 1 + eiy/x + DiX + Eix'^ (HERAPDF). The fit parameters are ai, Pi and the parameters in gi. 
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Some of these parameters may be chosen to take a fixed value (including zero). The general form Eq. ([T]l 
is adopted in all PDF sets which we discuss here except NNPDF, which instead lets 



fi{x,Qo) = Ci{x)NNi{x) 



(2) 



where NNi{x) is a neural network, and Ci{x) is is a "preprocessing" function. The fit parameters are the 
parameters which determine the shape of the neural network (a 2-5-3-1 feed-forward neural network for 
NNPDF2.0). The preprocessing function is not fitted, but rather chosen randomly in a space of functions 
of the general form Eq. (|2]) within some acceptable range of the parameters aj and f3i, and with gi = 1. 

The basis functions and number of parameters are the following. 

• ABKM parametrizes the two lightest flavours and antiflavours, the total strangeness and the gluon 
(five independent PDFs) with 21 free parameters. 

• CTEQ6.6 and CTIO parametrize the two lightest flavours and antiflavours the total strangeness and 
the gluon (six independent PDFs) with respectively 22 and 26 free parameters. 

• GJR parametrizes the two lightest flavours and antiflavours and the gluon with 20 free parameters 
(five independent PDFs); the strange distribution is assumed to be either proportional to the light 
sea or to vanish at a low scale Qo < I GeV at which PDFs become valence-like. 

• HERAPDF parametrizes the two lightest flavours, u, the combination d + s and the gluon with 10 
free parameters (six independent PDFs), strangeness is assumed to be proportional to the d distri- 
bution; HERAPDF also studies the effect of varying the form of the parametrization and of and 
varying the relative size of the strange component and thus determine a model and parametrization 
uncertainty (see Sect J3.23l for more details). 

• MSTW parametrizes the three lightest flavours and antiflavours and the gluon with 28 free param- 
eters (seven independent PDFs) to find the best fit, but 8 are held fixed in determining uncertainty 
eigenvectors. 

• NNPDF parametrizes the three lightest flavours and antiflavours and the gluon with 259 free pa- 
rameters (37 for each of the seven independent PDFs). 

2.2 PDF delivery and usage 

The way uncertainties should be determined for a given PDF set depends on whether it is a Monte Carlo 
set (NNPDF) or a Hessian set (all other sets). We now describe the procedure to be followed in each 
case. 

2.21 Computation of Hessian PDF uncertainties 

For Hessian PDF sets, both a central set and error sets are given. The number of eigenvectors is equal 
to the number of free parameters. Thus, the number of error PDFs is equal to twice that. Each eiTor 
set corresponds to moving by the specified confidence level (one sigma or 90% c.l.) in the positive or 
negative direction of each independent orthonormal Hessian eigenvector. 

Consider a variable X; its value using the central PDF for an error set is given by Xq. X^ is the 
value of that variable using the PDF corresponding to the "+" direction for the eigenvector i, and X^ 
the value for the variable using the PDF corresponding to the "— " direction. 



max 




AX. 



max 




o-xr,o)F 



(3) 
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2-dim rendition ofd-dim (-16) PDF parameter space 




contours of constant X^gi^i^^i 
Uf. eigenvector in the l-direction 
p(i)'. point of largest with tolerance T 



t(i) Sq-. global minimum 



• Hessian eigenvector basis sets 




resettling by 
the iterative method 




[a) 

Original parameter basis 



(h) 

Orthonormal eigenvector basis 



Fig. 1: A schematic representation of the transformation from the PDF parameter basis to the orthonormal eigenvector ba- 
sis (ni. 



AX+ adds in quadrature the PDF error contributions that lead to an increase in the observable X, 
and AX^ the PDF error contributions that lead to a decrease. The addition in quadrature is justified by 
the eigenvectors forming an orthonormal basis. The sum is over all N eigenvector directions. Ordinarily, 
one of — Xq and X^ — Xq will be positive and one will be negative, and thus it is trivial as to which 
term is to be included in each quadratic sum. For the higher number (less well-determined) eigenvectors, 
however, the "+" and "—"eigenvector contributions may be in the same direction. In this case, only the 
more positive term will be included in the calculation of AX+ and the more negative in the calculation 
of AX~ [24]. Thus, there may be less than N non-zero terms for either the "+" or "— " directions. A 
symmetric version of this is also used by many groups, given by the equation below: 



In most cases, the symmetric and asymmetric forms give very similar results. The extent to which 
the symmetric and asymmetric errors do not agree is an indication of the deviation of the distribution 
from a quadratic form. The lower number eigenvectors, corresponding to the best known directions in 
eigenvector space, tend to have very symmetric errors, while the higher number eigenvectors can have 
asymmetric errors. The uncertainty for a particular observable then will (will not) tend to have a quadratic 
form if it is most sensitive to lower number (higher number) eigenvectors. Deviations from a quadratic 
form are expected to be greater for larger excursions, i.e. for 90%c.l. limits than for 68% c.l. limits. 

The HERAPDF analysis also works with the Hessian matrix, defining experimental error PDFs in 
an orthonormal basis as described above. The symmetric formula Eq.|4]is most often used to calculate the 
experimental error bands on any variable, but it is possible to use the asymmetric formula as for MSTW 
and CTEQ. (For HERAPDFl.O these errors are provided at 68% c.l. in the LHAPDF file: HERAPDFIO 
EIG.LHgrid). 

Other methods of calculating the PDF uncertainties independent of the Hessian method, such as 
the Lagrange Multiplier approach |[T2l . are not discussed here. 

2.22 Computation of Monte Carlo PDF uncertainties 

For the NNPDF Monte Carlo set, a Monte Carlo sample of PDFs is given. The expectation value of any 
observable ^[{g}] (for example a cross-section) which depends on the PDFs is computed as an average 



AX 




(4) 
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over the ensemble of PDF replicas, using the following master formula: 

^^f<=p fc=i 

where A'rep is the number of replicas of PDFs in the Monte Carlo ensemble. The associated uncertainty 
is found as the standard deviation of the sample, according to the usual formula 




These formulae may also be used for the determination of central values and uncertainties of the parton 
distribution themselves, in which case the functional T is identified with the parton distribution q : 
= q. Indeed, the central value for PDFs themselves is given by 



^Vrep f^^^ 

NNPDF provides both sets of A^rcp = 100 and A'rep = 1000 replicas. The larger set ensures 
that statistical fluctuations are suppressed so that even oddly-shaped probability distributions such as 
non-gaussian or asymmetric ones are well reproduced, and more detailed features of the probability dis- 
tributions such as correlation coefficients or uncertainties on uncertainties can be determined accurately. 
However, for most common applications such as the determination of the uncertainty on a cross section 
the smaller replica set is adequate, and in fact central values can be determined accurately using a yet 
smaller number of PDFs (typically A^rcp ^ 10), with the full set of A'rep ~ 100 only needed for the 
reliable determination of uncertainties. 

NNPDF also provides a set in the NNPDF20_100.LHgrid LHAPDF file, as in previous releases 
of the NNPDF family, while replicas 1 to 100 correspond to PDF sets 1 to 100 in the same file. This set 
contains the average of the PDFs, determined using Eq. (IT]): in other words, set contains the central 
NNPDF prediction for each PDF. This central prediction can be used to get a quick evaluation of a 
central value. However, it should be noticed that for any which depends nonlinearly on the PDFs, 

/ This means that a cross section evaluated from the central set is not exactly equal 

to the central cross section (though it will be for example for deep-inelastic structure functions, which 
are linear in the PDFs). Hence, use of the set is not recommended for precision applications, though 
in most cases it will provide a good approximation. Note that set q^^^ should not be included when 
computing an average with Eq. (O, because it is itself already an average. 

Equation Q provides the 1-sigma PDF uncertainty on a general quantity which depends on PDFs. 
However, an important advantage of the Monte Carlo method is that one does not have to rely on a 
Gaussian assumption or on linear error propagation. As a consequence, one may determine directly a 
confidence level: e.g. a 68% c.l. for is simply found by computing the N^-cp values of T and 

discarding the upper and lower 16% values. In a general non-gaussian case this 68% c.l. might be 
asymmetric and not equal to the variance (one-sigma uncertainty). For the observables of the present 
benchmark study the 1-sigma and 68% c.l. PDF uncertainties turn out to be very similar and thus only 
the former are given, but this is not necessarily the case in in general. For example, the one sigma eiTor 
band on the NNPDF2.0 large x gluon and the small x strangeness is much larger than the corresponding 
68% CL band, suggesting non-gaussian behavior of the probability distribution in these regions, in which 
PDFs are being extrapolated beyond the data region. 
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3. PDF determinations - Theoretical uncertainties 

Theoretical uncertainties of PDFs determined in global fits reflect the approximations in the theory which 
is used in order to relate PDFs to measurable quantities. The study of theoretical PDF uncertainties is 
currently less advanced that that of experimental uncertainties, and only some theoretical uncertainties 
have been explored. One might expect that the main theoretical uncertainties in PDF determination 
should be related to the treatment of the strong interaction: in particular to the values of the QCD 
parameters, specifically the value of the strong coupling as and of the quark masses rric and mf, and 
uncertainties related to the truncation of the perturbative expansion (commonly estimated through the 
variation of renormalization and factorization scales). Further uncertainties are related to the treatment 
of heavy quark thresholds, which are handled in various ways by different groups (fixed flavour number 
vs. variable flavour number schemes, and in the latter case different implementations of the variable 
flavour number scheme), and to further approximations such as the use of K-factor approximations. Fi- 
nally, more uncertainties may be related to weak interaction parameters (such as the W mass) and to the 
treatment of electroweak effects (such as QED PDF evolution |[T6l ). 

Of these uncertainties, the only one which has been explored systematically by the majority of 
the PDF groups is the uncertainty. The way ag uncertainty can be determined using CTEQ, HER- 
APDF, MSTW, and NNPDF will be discussed in detail below. HERAPDF also provides model and 
parametrization uncertainties which include the effect of varying and nic, as well as the effect of 
varying the parton parametrization, as will also be discussed below. Sets with varying quark masses and 
their implications have recently been made available by MSTW 1 17], the effects of varying nic and rub 
have been included by ABKM |2| and preliminary studies of the effect of and rric have also been 
presented by NNPDF ifTSl . Uncertainties related to factorization and renormalization scale variation and 
to electroweak effects are so far not available. For the benchmarking exercise of Sec. [52 results are given 
adopting common values of electroweak parameters, and at least one common value of ag (though values 
for other values of Og are also given), but no attempt has yet been made to benchmark the other aspects 
mentioned above. 

3.1 The value of ag and its uncertainty 

We thus turn to the only theoretical uncertainty which has been studied systematically so far, namely 
the uncertainty on ag. The choice of value of ag is clearly important because it is strongly correlated to 
PDFs, especially the gluon distribution (the correlation of ag with the gluon distribution using CTEQ, 
MSTW and NNPDF PDFs is studied in detail in Ref. HH). See also Ref. Q for a discussion of this 
correlation in the ABKM PDFs. There are two separate issues related to the value of ag in PDF fits: first, 
the choice of as{mz) for which PDFs are made available, and second the choice of the preferred value 
of ag to be used when giving PDFs and their uncertainties. The two issues are related but independent, 
and for each of the two issue two different basic philosophies may be adopted. 

Concerning the range of available values of Og : 

• PDFs fits are performed for a number of different values of ag. Though a PDF set corresponding to 
some reference value of Ug is given, the user is free to choose any of the given sets. This approach 
is adopted by CTEQ (0.118), HERAPDF (0.1176), MSTW (0.120) and NNPDF (0.119), where 
we have denoted in parenthesis the reference (NLO) value of ag for each set. 

• ag{mz) is treated as a fit parameters and PDFs are given only for the best-fit value. This approach 
is adopted by ABKM (0. 1 179) and GJR (0.1 145), where in parenthesis the best-fit (NLO) value of 
ag is given. 

Concerning the preferred central value and the treatment of the ag uncertainty: 

• The value of as{mz) is taken as an external parameter, along with other parameters of the fit such 
as heavy quark masses or electroweak parameter. This approach is adopted by CTEQ, HERA- 
PDFl.O and NNPDF. In this case, there is no apriori central value of as{mz) and the uncertainty 
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NLO KgC^z) values used by different PDF groups 
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Fig. 2: Values of as{mz) for which fits are available. The default values and uncertainties used by each group are also shown. 
Plot by G. Watt|27J. 



on as{mz) is treated by repeating the PDF determination as is varied in a suitable range. 
Though a range of variation is usually chosen by the groups, any other range may be chosen by 
the user. 

• The value of as{mz) is treated as a fit parameter, and it is determined along with the PDFs. This 
approach is adopted by MSTW, ABKM and GJR08. In the last two cases, the uncertainty on is 
part of the Hessian matrix of the fit. The MSTW approach is explained below. 

As a cross-check,CTEQ ll20l has also used the world average value of as{mz) as an additional input to 
the global fit. 

The values of as{mz) for which fits are available, as well as the default values and uncertainties 
used by each group are summarized in Fig. [HEl The most recent world average value of as{mz) is 
as = 0.1184 lb 0.0007 ll22l 0. However, a more conservative estimate of the uncertainty on ag was 
felt to be appropriate for the benchmarking exercise summarized in this note, for which we have taken 
Aos = ±0.002 at 90%c.l. (corresponding to 0.0012 at one sigma). This uncertainty has been used for 
the CTEQ, NNPDF and HERAPDF studies. For MSTW, ABKM and GJR the preferred as uncertainty 
for each group is used, though for MSTW in particular this is close to 0.0012 at one sigma. It may not 
be unreasonable to argue that a yet larger uncertainty may be appropriate. 

When comparing results obtained using different PDF sets it should be borne in mind that if 
different values of are used, cross section predictions change both because of the dependence of the 
cross section on the value of as (which for some processes such as top production or Higgs production 
in gluon-gluon fusion may be quite strong), and because of the dependence of the PDFs themselves on 
the value of a^. Differences due to the PDFs alone can be isolated only when performing comparisons 
at a common value of as- 

3.2 Computation of PDF+Os uncertainties 

Within the quadratic approximation to the dependence of on parameters (i.e. linear error propagation), 
it turns out that even if PDF uncertainty and the as{mz) uncertainty are correlated, the total one-sigma 
combined PDF+a^ uncertainty including this correlation can be simply found without approximation 

'There is implicitly an additional uncertainty due to scale variation.See for example Ref. |26|. 

''We note that the values used in the average are from extractions at different orders in the perturbative expansion. 
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by computing the one sigma PDF uncertainty with fixed at its central value and the one-sigma ag 
uncertainty with the PDFs fixed their central value, and adding results in quadrature [20], and similarly 
for any other desired confidence level. 

For example, if l^XpoF is the PDF uncertainty for a cross section X and AXq,^(^^) is the ag 
uncertainty, the combined uncertainty AX is 

= ^^XpDF + ^Xl^:m,) (8) 

Other treatments can be used when deviations from the quadratic approximation are possible. 
Indeed,for MSTW because of the use of dynamical tolerance linear error propagation does not necessarily 
apply. For NNPDF, because of the use of a Monte Carlo method linear error propagation is not assumed: 
in practice, addition in quadrature turns out to be a very good approximation, but an exact treatment is 
computationally simpler. We now describe in detail the procedure for the computation of ag and PDF 
uncertainties (and for HERAPDF also of model and parametrization uncertainties) for various parton 
sets. 

3.21 CTEQ - Combined PDF and Ug uncertainties 

CTEQ takes a°(mz) = 0.118 as an external input parameter and provides the CTEQ6.6alphas ||20l (or 
the CTlOalpha Q) series which contains 4 sets extracted using ag{mz) = 0.116, 0.117, 0.119, 0.120; 
The uncertainty associated with ag can be evaluated by computing any given observable with ag = 
0.118 lb S'^^^^ in the partonic cross-section and with the PDF sets that have been extracted with these 
values of ag. The differences 

A^- = ^(aO + 5(68) c,^) _ ^(aO)^ ^ _^(^o _ ^(68)^^) _ _^(^0) 

are the Ug uncertainties according to CTEQ. In 1201 it has been demonstrated that, in the Hessian 
approach, the combination in quadrature of PDF and asuncertainties is correct within the quadratic 
approximation. In the studies in Ref. 120], CTEQ did not find appreciable deviations from the quadratic 
approximation, and thus the procedure described below will be accurate for the cross sections considered 
here. 

Therefore, for CTEQ6.6 the combined PDE+a^uncertainty is given by 

A^^^+- = y(A-)' + ((AF^L)+)' (10) 
A^^^+- = y(A-)V((AF^L)-)' 

3.22 MSTW - Combined PDF and Ug uncertainties 

MSTW fits ag together with the PDFs and obtains a°(7VLO) = 0.1202t°;°°}^ and a^^{NNLO) = 
0.1171 lb 0.0014. Any correlation between the PDF and the uncertainties is taken into account with 
the following recipe [23 1. Beside the best-fit sets of PDFs, which correspond to a^{NLO, NNLO), 
four more sets,both at NLO and at NNLO, of PDFs are provided. The latter aie extracted setting as input 
tts = lb 0.5(Tq^ , a^ ib Uo^, where is the standard deviation indicated here above. Each of these 
extra sets contains the full parametrization to describe the PDF uncertainty. Comparing the results of the 
five sets, the combined PDE-t-Q^uncertainty is defined as: 

^PDF+a. ^ max{F-^(5o) + (AF^b^)+}-F"°(5o) (11) 
^PDF+a. = F°°(5o)-min{F"=(So)-(AF^|,^)_} 
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where max, min run over the five values of under study, and the corresponding PDF uncertainties are 
used. 

The central and = ± O.Sctq,^ , it a^s, where cJq^ sets are all obtained using the dynamical 
tolerance prescription for PDF uncertainty which determines the uncertainty when the quality of the fit 
to any one data set (relative to the best fit for the preferred value of as{Mz)) becomes sufficiently poor. 
Naively one might expect that the PDF uncertainty for the it might then be zero since one is by 
definition already at the limit of allowed fit quality for one data set. If this were the case the procedure 
of adding PDF and as uncertainties would be a very good approximation. However, in practice there is 
freedom to move the PDFs in particular directions without the data set at its limit of fit quality becoming 
worse fit, and some variations can be quite large before any data set becomes sufficiently badly fit for 
the criterion for uncertainty to be met. This can led to significantly larger PDF +as uncertainties than 
the simple quadratic prescription. In particular, since there is a tendency for the best fit to have a too 
low value of dF2/dhiQ'^ at low x, at higher value the small-x gluon has freedom to increase without 
spoiling the fit, and the PDF +as uncertainty is large in the upwards direction for Higgs production. 

3.23 HERAPDF - as, model and parametrization uncertainties 

HERAPDF provides not only uncertainties, but also model and parametrization uncertainties. Note 
that at least in part parametrization uncertainty will be accounted for by other groups by the use of a 
significantly larger number of initial parameters, the use of a large tolerance (CTEQ, MSTW) or by a 
more general parametrization (NNPDF), as discussed in Sect. 12.131 However, model uncertainties related 
to heavy quark masses are not determined by other groups. 

The model errors come from variation of the choices of: charm mass (m-c = 1.35 — ?■ 1.65GeV); 
beauty mass (m;, = 4.3 — )• 5.0 GeV); minimum 

Q2 of data used in the fit (Q^,„ = 2.5 5.0 GeV^); 
fraction of strange sea in total d-type sea (/^ = 0.23 — )• 0.38 at the starting scale). The model errors are 
calculated by taking the difference between the central fit and the model variation and adding them in 
quadrature, separately for positive and negative deviations. (For HERAPDFl.O the model variations are 
provided as members 1 to 8 of the LHAPDF file: HERAPDFIO VAR.LHgrid). 

The parametrization errors come from: variation of the starting scale = 1.5 — )• 2.5 GeV^; 
variations of the basic 10 parameter fit to 11 parameter fits in which an extra parameter is allowed to 
be free for each fitted parton distribution. In practice only three of these extra parameter variations 
have significantly different PDF shapes from the central fit. The parametrization errors are calculated by 
storing the difference between the parametrization variant and the central fit and constructing an envelope 
representing the maximal deviation at each x value. (For HERAPDFl.O the parametrization variations 
are provided as members 9 to 13 of the LHAPDF file: HERAPDFIO VAR.LHgrid). 

HERAPDF also provide an estimate of the additional error due to the uncertainty on as{Mz)- 
Fits are made with the central value, as{Mz) = 0.1176, varied by ±0.002. The 90% c.l. as error on 
any variable should be calculated by adding in quadrature the difference between its value as calculated 
using the central fit and its value using these two alternative as values; 68% c.l. values may be obtained 
by scaling the result down by 1.645. (For HERAPDFl.O these ag variations are provided as members 
9,10,11 of the LHAPDF file: HERAPDFIO ALPHAS.LHgrid for a, (M^) = 0.1156,0.1176,0.1196, 
respectively). Additionally members 1 to 8 provide PDFs for values of as{Mz) ranging from 0.114 to 
0.122). The total PDF + as uncertainty for HERAPDF should be constructed by adding in quadrature 
experimental, model, parametrization and as uncertainties. 

3.24 NNPDF - Combined PDF and as uncertainties 

For the NNPDF2.0 family, PDF sets obtained with values of as {mz ) in the range from 0. 1 14 to 0. 124 in 

steps of Aas = 0.001 are available in LHAPDF. Each of these sets is denoted by NNPDF20_as_01 14_100.LHgrid, 
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NNPDF20_as_0115_100.LHgrid, ... and has the same structure as the central NNPDF20_100.LHgrid set: 
PDF set number is the average PDF set, as discussed above 



= i^a.) = J^E ■ (12) 

^^I-CP k=l 

for the different values of ag, while sets from 1 to 100 are the 100 PDF replicas corresponding to this 
particular value of a^. Note that in general not only the PDF central values but also the PDF uncertainties 
will depend on Ug. 

The methodology used within the NNPDF approach to combine PDF and as uncertainties is dis- 
cussed in Ref. |[T9ll28l . One possibility is to add in quadrature the PDF and uncertainties, using PDFs 
obtained from different values of a^, which as discussed above is correct in the quadratic approximation. 
However use of the exact correlated Monte Carlo formula turns out to be actually simpler, as we now 
show. 

If the sum in quadrature is adopted, for a generic cross section which depends on the PDFs and 
the strong coupling a (PDF, a^), we have 

(Ht =^(PDFW,aW±5„,) -a(PDFW,a(0)) , (13) 

where PDF^^'^) stands schematically for the PDFs obtained when is varied within its 1-sigma range, 
a^s^ zb ■ The PDF+a^ uncertainty is 



iS^)pDF+a. = \l " + ('J^)PDF " • (14) 

with {Sa)pY)p the PDF uncertainty on the observable a computed from the set with the central value of 
as- 

The exact Monte Carlo expression instead is found noting that the average over Monte Carlo 
replicas of a general quantity which depends on both as and the PDFs, J-" (PDF, as) is 

(-^U = 1^E E PDF('^^'^),«0) , (15) 

where PDF^'^'^'-^^ stands for the replica kj of the PDF fit obtained using ai''^ as the value of the strong 
coupling; A'rep is the total number of PDF replicas 

^'^^ U) 

iVrep = E^rep ' d^) 

i=i 

and N"^^^ is the number of PDF replicas for each value ai"'^ of a^. If we assume that as is gaussianly 
distributed about its central value with width equal to the stated uncertainty, the number of replicas for 
each different value of is 



2 



± 



ATf^ oc exp 1^-^ ^ I . (17) 

with a^^^ and 6^^ the assumed central value and 1-sigma uncertainty of as(m^). Clearly with a Monte 
Carlo method a different probability distribution of ag values could also be assumed. For example, if 
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we assume a^lM^) = 0.119±0.0012 and we take nine distinct values a^i' = 0.115, 0.116, 0.117, 0.118, 

0.119,0.120,0.123, assuming 100 replicas for the central value (as = 0.119) we get 7V°j^' = 0,4,25,71, 
100,71,25,4,0. 

The combined PDF+Os uncertainty is then simply found by using Eq. Q with averages com- 
puted using Eq. ([T5] ). The difference between Eq. (031 ) and Eq. (fT4l) measures deviations from linear 
error propagation. The NNPDF benchmark results presented below are obtained using Eq. ([Ts] ) with 
Ois {"mz) = 0.1190 lb 0.0012 at one sigma. No significant deviations from linear error propagation were 
observed. 

It is interesting to observe that the same method can be used to determine the combined uncertainty 
of PDFs and other physical parameters, such as heavy quark masses. 



4. PDF correlations 

The uncertainty analysis may be extended to define a correlation between the uncertainties of two vari- 
ables, say X{a) and Y{d). As for the case of PDFs, the physical concept of PDF correlations can be 
determined both from PDF determinations based on the Hessian approach and on the Monte Carlo ap- 
proach. 



4.1 PDF correlations in the Hessian approach 

Consider the projection of the tolerance hypersphere onto a circle of radius 1 in the plane of the gradients 
VX and VY in the parton parameter space lfT3ll24l . The circle maps onto an ellipse in the XY plane. 
This "tolerance ellipse" is described by Lissajous-style parametric equations, 

X = Xo + AXcos6', (18) 
Y = ^0 + Aycos(e + (^), (19) 

where the parameter 9 varies between and 2tt, Xq = X(ao), and Yq = y(ao). AX and Ay are the 
maximal variations 5X = X — Xq and 5Y = Y — Yq evaluated according to the Master Equation, and 
(/? is the angle between VX and VY in the {aj} space, with 



VX -VY 1 



TV 



1=1 

The quantity cos ip characterizes whether the PDF degrees of freedom of X and Y are correlated 
(cos « 1), anti-correlated (cos (p ^ —1), or uncorrelated (cos p 0). If units for X and Y are rescaled 
so that AX = Ay (e.g., AX = Ay = 1), the semimajor axis of the tolerance ellipse is directed at 
an angle 7r/4 (or Sir/ 4) with respect to the AX axis for cos (/? > (or cos p < 0). In these units, the 
ellipse reduces to a line for cosy? = ±1 and becomes a circle for cosp = 0, as illustrated by Fig. [3] 
These properties can be found by diagonalizing the equation for the correlation ellipse. Its semiminor 
and semimajor axes (normalized to AX = Ay) are 

simp 
cos p 



The eccentricity e = y 1 — {aminm- / dmajorY is therefore approximately equal to a/|cos p\ as |cos p\ 
1. 



f SXy { 5YY f 6X\ f 6Y\ 2 
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Fig. 3: Correlations ellipses for a strong correlation (left), no correlation (center) and a strong anti-correlation(right) |4). 



A magnitude of | cos tp\ close to unity suggests that a precise measurement of X (constraining 5X 
to be along the dashed line in Fig. [3]) is likely to constrain tangibly the uncertainty 5Y in Y, as the value 
of Y shall lie within the needle-shaped error ellipse. Conversely, cos 99 implies that the measurement 
of X is not likely to constrain 5Y stronglyj^] 

The values of AX, /XY, and cos 93 are also sufficient to estimate the PDF uncertainty of any 
function /(X, Y) of X and Y by relating the gradient of /(X, Y) todxf = Of /dX and dyf^df /dY 
via the chain rule: 



A/= V/ 



(AX dxf f + 2AX Ay cos^dxf dyf + (AF dyff. 



(23) 



Of particular interest is the case of a rational function /(X, Y) = X™/y", pertinent to computations 
of various cross section ratios, cross section asymmetries, and statistical significance for finding signal 
events over background processes 1241 . For rational functions Eq. (1231 ) takes the form 




AX Ay / Ay 

2mn—r: tt— cos ip + ( n- 



Xn Yn 



Yn 



(24) 



For example, consider a simple ratio, / = X/Y. Then A/ //q is suppressed (A// /o ~ |AX/Xo — Ay/yd) 
if X and Y are strongly correlated, and it is enhanced (A///0 ~ AX/Xq + AY/Yq) if X and Y are 
strongly anticorrelated. 

As would be true for any estimate provided by the Hessian method, the correlation angle is in- 
herently approximate. Eq. ( |20l ) is derived under a number of simplifying assumptions, notably in the 
quadratic approximation for the function within the tolerance hypersphere, and by using a symmet- 
ric finite-difference formula for {diX} that may fail if X is not monotonic. With these limitations in 
mind, we find the correlation angle to be a convenient measure of interdependence between quantities 
of diverse nature, such as physical cross sections and parton distributions themselves. For example, in 
Section |5. 22 [ the correlations for the benchmark cross sections are given with respect to that for Z pro- 
duction. As expected, the and W~ cross sections are very correlated with that for the Z, while the 
Higgs cross sections are uncorrelated ('mHiggs=l20 GeV) or anti-correlated {mHiggs=2A0 GeV). Thus, 
the PDF uncertainty for the ratio of the cross section for a 240 GeV Higgs boson to that of the cross 
section for Z boson production is larger than the PDF uncertainty for Higgs boson production by itself. 

A simple C code (corr.C) is available from the PDF4LHC website that calculates the correlation 
cosine between any two observables given two text files that present the cross sections for each observable 
as a function of the error PDFs. 



'The allowed range of 5Y / ISY for a given 5 
Vl — 5^ sin if. 



SX/AX is 



.(-) 



< SY/AY < 



.(+) 



r Y J where 



= 5 cos tp ± 
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4.2 PDF correlations in the Monte Carlo approach 

General correlations between PDFs and physical observables can be computed within the Monte Carlo 
approach used by NNPDF using standard textbook methods. To illustrate this point, let us compute the 
the correlation coefficient i?] for two observables A and B which depend on PDFs (or are PDFs 
themselves). This correlation coefficient in the Monte Carlo approach is given by 

where the averages are taken over ensemble of the N^cp values of the observables computed with the 
different replicas in the NNPDF2.0 set, and aA,B are the standard deviations of the ensembles. The 
quantity p characterizes whether two observables (or PDFs) are correlated (p ^ 1), anti-correlated (p 
— 1) or uncorrected (p 0). 

This correlation can be generalized to other cases, for example to compute the correlation between 
PDFs and the value of the strong coupling as{mz), as studied in Ref. [19. 28,1. for any given values of 
X and Q^. For example, the correlation between the strong coupUng and the gluon at x and (or in 
general any other PDF) is defined as the usual correlation between two probability distributions, namely 



f,,2\ ( nA] ^rep {^s{Ml)g{x,Q^))^^^-{as{Ml))^^^{g{x,Q^)\ 

as yMz] ,g[x,Q )\ 



(26) 

where averages over replicas include PDF sets with varying ag in the sense of Eq. (fTSl ). Note that the 
computation of this correlation takes into account not only the central gluons of the fits with different ag 
but also the corresponding uncertainties in each case. 



17 



5. The PDF4LHC benchmarks 

A benchmarking exercise was carried out to which all PDF groups were invited to participate. This 
exercise considered only the-then most up to date published versions/most commonly used of NLO 
PDFs from 6 groups: ABKM09 S, S, CTEQ6.6 |4|, GJR08 |7|, HERAPDFl.O LSJ, MSTW08 f9\, 
NNPDF2.0 ifTOl . The benchmark cross sections were evaluated at NLO at both 7 and 14 TeV. We report 
here primarily on the 7 TeV results. 

All of the benchmark processes were to be calculated with the following settings: 

1. at NLO in the MS scheme 

2. all calculation done in a the 5-flavor quark ZM-VFNS scheme, though each group uses a different 
treatment of heavy quarks 

3. at a center-of-mass energy of 7 TeV 

4. for the central value predictions, and for ±68% and ±90% c.l. PDF uncertainties 

5. with and without the uncertainties, with the prescription for combining the PDF and Us errors 
to be specified 

6. repeating the calculation with a central value of as{mz) of 0.119. 

To provide some standardization, a gzipped version of MCFM5.7 |[25]| was prepared by John 
Campbell, using the specified parameters and exact input files for each process. It was allowable for 
other codes to be used, but they had to be checked against the MCFM output values. 

The processes included in the benchmarking exercise are given below. 

1. W~^, and Z cross sections and rapidity distributions including the cross section ratios /W^ 
and {W~^+W~)/Z and the W asymmetry as a function of rapidity {[W^ {y) — W~ {y)]/[W~^ {y) + 
W-{y)]). 

The following specifications were made for the W and Z cross sections: 

(a) 7712=91.188 GeV 

(b) mvi/=80.398 GeV 

(c) zero width approximation used 

(d) GF=0.n6631 XlO-^GeV-^ 

(e) sin'^ew = 0.2227 

(f) other EW couplings derived using tree level relations 

(g) BR{Z II) = 0.03366 

(h) BR(1^ ^ lu) = 0.mO 

(i) CKM mixing parameters from Eq. 1 1 .27 of the PDG2009 CKM review 
(j) scales: fiR = = mz or mw 

2. gg — )• Higgs total cross sections at NLO in the Standard Model 
The following specifications were made for the Higgs cross section. 

(a) mH = 120, 180 and 240 GeV 

(b) zero Higgs width approximation, no branching ratios taken into account 

(c) top loop only, with mtop = 171.3 GeV in do 

(d) scales: fj,ji = fj,p = ni Higgs 

3. tt cross section at NLO 

(a) mtop = 171.3 GeV 

(b) zero top width approximation, no branching ratios 

(c) scales: fiR = fip = i^top 
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The cross sections chosen are all important cross sections at the LHC, for standard model bench- 
marking for the case of the W, Z and top cross sections and discovery potential for the case of the Higgs 
cross sections. Both qq and gg initial states are involved. The NLO W and Z cross sections have a small 
dependence on the value of as{mz), while the dependence is sizeable for both ti and Higgs production. 



5.1 Comparison between benchmark predictions 

Now we turn to compare the results of the various PDF sets for the LHC observables with the common 
benchmark settings discussed above. To perform a more meaningful comparison, it is useful to first 
introduce the idea of differential parton-parton luminosities. Such luminosities, when multiplied by the 
dimensionless cross section sa for a given process, provide a useful estimate of the size of an event cross 
section at the LHC. Below we define the differential parton-parton luminosity dLij/ds: 

S = ;r^[/'(-i-f')/.N.'') + (i«2)i. (27) 

The prefactor with the Kronecker delta avoids double-counting in case the partons are identical. The 
generic parton-model formula 



(28) 



can then be written as 



(29) 



Relative quark-antiquark and gluon-gluon PDF luminosities are shown in Figures|4]and[5] CTEQ6.6, 
NNPDF2.0, HERAPDFl.O, MSTW08, ABKM09 and GJR08 PDF luminosities are shown, all normal- 
ized to the MSTW08 central value, along with their 68 %c.l. error bands. The inner uncertainty bands 
(dashed lines)for HERAPDFl.O correspond to the (asymmetric) experimental errors, while the outer un- 
certainty bands (shaded regions) also includes the model and parameterisation errors. It is interesting 
to note that the error bands for each of the PDF luminosities are of similar size. The predictions of 
W/Z, ti and Higgs cross sections are in reasonable agreement for CTEQ, MSTW and NNPDF, while 
the agreement with ABKM, HERAPDF and GJR is somewhat worse. (Note however that these plots do 
not illustrate the effect that the different as{mz) values used by different groups will have on (mainly) 
ti and Higgs cross sections.) It is also notable that the PDF luminosities tend to differ at low x and 
high x, for both qq and gg luminosities. The CTEQ6.6 distributions, for example, may be larger at low 
X than MSTW2008, due to the positive-definite parameterization of the gluon distribution; the MSTW 
gluon starts off negative at low x and and this results in an impact for both the gluon and sea quark 
distributions at larger values. The NNPDF2.0 qq luminosity tends to be somewhat lower, in the Z 
region for example. Part of this effect might come from the use of a ZM heavy quark scheme, although 
other differences might be relevant. 

After having performed the comparison between PDF luminosities, we turn to the comparison of 
LHC observables. Perhaps the most useful manner to perform this comparison is to show the cross- 
sections as a function of as, with an interpolating curve connecting different values of for the same 
group, when available |[27l (see Figs. [6l|9l). Following the interpolating curve, it is possible to compare 
cross sections at the same value of a^. The predictions for the CTEQ, MSTW and NNPDF W and Z 
cross sections at 7 TeV (Figs. [6l|71) agree well, with the NNPDF predictions somewhat lower, consistent 
with the behaviour of the luminosity observed in Fig. |4] The cross sections from HERAPDFl.O and 
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4: The qq luminosity functions and their uncertainties at 7 TeV, normalized to the MSTW08 result. Plot by G. Watt 127 i . 
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Fig. 6: Cross section predictions at 7 TeV for and Z production. All Z cross sections plotted here use a value of sin'^ 9w 



0.23149. Plot by G. Watt (27)- 
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Fig. 7: Cross section predictions at 7 TeV for the W/Z and /W production. All Z cross sections plotted here use a value 



of sin 



= 0.23149. Plot by G. Watt [27]. 



ABKM09 are somewhat larger |^ The impact from the variation of the value of as is relatively small. 
Basically, all of the PDFs predict similar values for the W/Z cross section ratio; much of the remaining 
uncertainty in this ratio is related to uncertainties in the strange quark distribution. This will serve as 
a useful benchmark at the LHC. A larger variation in predictions can be observed for the /W~ 
ratio (see Fig. IT]). This quantity depends on the separation of the quarks into flavours and the separation 
between quarks and antiquarks. The data providing this information only extends down to x = 0.01, and 
consists partially of neutrino DIS off nuclear targets. Hence, different groups provide different results 
because they fit different choices of data, make different assumptions about nuclear corrections and make 
different assumptions about the parametric forms of nonsinglet quarks relevant for x < 0.01. 

The predictions for Higgs production from gg fusion (Figs. [8]|9l) depend strongly on the value of 
as', the anticorrelation between the gluon distribution and the value of is not sufficient to offset the 
growth of the cross section (which starts at 0{al) and undergoes a large O(a^) correction). The CTEQ, 



Updated versions of these plots, including an extension to NNLO, will be presented in a forthcoming MSTW publication. 
See also Ref. Uj. 
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Fig. 8: Cross section predictions at 7 TeV for a Higgs boson (gg fusion) for a Higgs mass of 120 GeV (left) and 180 GeV(right). 
Plot by G. Watt (27). 



MSTW and NNPDF predictions are in moderate agreement but CTEQ lies somewhat lower, to some 
extent due to the lower choice of as(M|). Compared at the common value of as(M|) = 0.119, the 
CTEQ prediction and that of either MSTW or NNPDF, have one-sigma PDF uncertainties which just 
about overlap for each value of mn- If the comparison is made at the respective reference values of 
Os, but without accounting for the Os uncertainty, the discrepancies are rather worse, and indeed, even 
allowing for Ug uncertainty, the bands do not overlap. Hence, both the difference in PDFs and in the 
dependence of the cross section on the value of are responsible for the differences observed. A useful 
measure of this is to note that the difference in the central values of the MSTW and CTEQ predictions 
for a common value of as{M^) = 0.119 for a 120 GeV Higgs (a typical discrepancy) is equivalent to a 
change in as{M'^) of about 0.0025. The worst PDF discrepancy is similar to a change of about 0.004. 
The predictions from HERAPDF are rather lower, reflecting the behaviour of the gluon luminosity of 
Fig. [51 The ABKM and GJR predictions are also rather lower, but the ag dependence of results is not 
explicitly available for these groups, hence it is hard to tell how much of the discrepancy is due to the 
fact that these groups adopt low values of Og. 

Production of a tt pair (Fig. |9l right plot) probes the gluon-gluon luminosity at a higher value of 
V§, with smaller higher order corrections than present for Higgs production through gg fusion. The cross 
section predictions from CTEQ6.6, MSTW2008 and NNPDF2.0 are all seen to be in good agreement, 
especially when evaluated at the common value of as{mz) of 0.119. 

5.2 Tables of results from each PDF set 

In the subsections below, we provide tables of the benchmark cross sections from the PDF groups par- 
ticipating in the benchmark exercise. Only results for 7 TeV will be provided for this interim version of 
the note. 

5. 21 ABMK09 NLO 5 Flavours 

In the following sub-section, the tables of relevant cross sections for the ABKM09 PDFs are given. 
Results are given for the value of as{mz) determined from the fit. The charm mass is taken to be 
1.5 lb 0.25 GeV and the bottom mass is taken to be 4.5 ± 0.5 GeV. The heavy quark mass uncertainites 
are incorporated in with the PDF uncertainties. 

The results obtained with the ABKM09 NLO 5 flavours set are reported in Tables 1-2. 
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Fig. 9: Cross section predictions at 7 TeV for a Higgs boson of mass 240 GeV (left) and for tt production (riglit). Plot by G. 
Watt (27). 



Process 


Cross section 


combined PDF and errors 


aw+ *BR{W+ l+:^)[nb] 


6.3398 


0.0981 


aw- * BR{W- l-v)[nh] 


4.2540 


0.0657 


az" *BR{Z° l+l-)[nb] 


0.9834 


0.0151 




139.55 


7.96 


agg^mggsil^Q GeV)[ph] 


11.663 


0.314 


CTsg^mgg.(180 GeV)[ph] 


4.718 


0.147 


CJgg^mggsi'^^Q GeV)[ph] 


2.481 


0.092 



Table 1 . Benchmark cross section predictions and uncertainties for ABKM09 NLO = 5 for , Z, tt 
and Higgs production (120, 180, 240 GeV) at 7 TeV. The central prediction is given in column 2. Errors 
are quoted at the 68% CL. The PDF and as{mz) errors are evaluated simultaneously. Higgs boson cross 
sections are corrected for finite top mass effects (1.06, 1.15 and 1.31 for masses of 120, 180 and 240 
GeV respectively. 
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Table 2. Benchmark cross section predictions (da/dy * BR in nb) for ABKM09 NLO with Uf = 5 for 
, Z° production at 7 TeV, as a function of boson rapidity. 

5.22 CTEQ6.6 

In the following sub-section, the tables of relevant cross sections for the CTEQ6.6 PDFs are given (Tables 
3-6). The predictions for the central value of as{mz) are given in bold. Errors are quoted at the 68% c.l. 
For CTEQ6.6, this involves dividing the normal 90%c.l. errors by a factor of 1.645. 



as{mz) 


aw+ * BR{W+ /+i/)[nb] 


aw- *BR{W- l-i^)[nb] 


az" *BR{Z'' l+l-)[nb] 


0.116 


5.957 
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0.9331 
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5.993 


4.068 
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0.118 


6.057 


4.106 


0.9469 
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6.064 
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0.120 
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4.139 


0.9539 



Table 3: Benchmark cross section predictions for CTEQ6.6 for W^, Z and tt production at 7 TeV, as 
a function of agimz)- The results for the central value of as{mz) for CTEQ6.6 (0.118) are shown in 
bold. 
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Table 4: Benchmark cross section predictions for CTEQ6.6 for gg — Higgs production (masses of 120, 
180 and 240 GeV), and for production, at 7 TeV, as a function of as{mz)- The results for the central 
value of as{mz) for CTEQ6.6 (0.118) are shown in bold. Higgs production ross sections have been 
corrected for the finite top mass effect (a factor of 1.06 for 120 GeV, 1.15 for 180 GeV and 1.31 for 240 
GeV). 
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<jgg^Hi33s(2AQ GeV)\pb] 


2.610 


+0.054/-0.058 
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0.078 


-0.73 



Table 5: Benchmark cross section predictions and uncertainties for CTEQ6.6 for W^, Z,tt and Higgs 
production (120, 180, 240 GeV) at 7 TeV. The central prediction is given in column 2. Errors are quoted 
at the 68% c.l.. Both the symmetric and asymmetric forms for the PDF errors are given. In the next-to- 
last column, the (symmetric) form of the PDF and as{mz) errors are added in quadrature. In the last 
column, the correlation cosine with respect to Z production is given. 
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Table 6: Benchmark cross section predictions (da/dy * BR in nb) for CTEQ6.6 for W^, Z° production 
at 7 TeV, as a function of boson rapidity. 

5.23 GJR 

In the following sub-section, the tables of relevant cross sections for the GJR08 PDFs are given (Tables 
7-8). The results are given at the fit value of as{mz) and the errors correspond to the PDF-only errors at 
68% c.l. 



Process 


Cross section 


PDF Error 


aw+*BR{W+ l+iy)[nb] 


5.74 


0.11 
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0.09 



Table 7: Benchmark cross section predictions and uncertainties for GJR08 for W^, Z,tt md 
Higgs production (120, 180, 240 GeV) at 7 TeV. The results are given at the fit value of as{mz) and the 
errors correspond to the PDF-only errors at 68% c.l. Higgs boson cross sections have been corrected for 
the finite top mass effect (a factor of 1.06 for 120 GeV, 1.15 for 180 GeV and 1.31 for 240 GeV). 
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Table 8: Benchmark cross section predictions {da /dy * BR in nb) for GJR for W , Z° production at 
7 TeV, as a function of boson rapidity. The results are given at the fit value of as{mz) and the errors 
correspond to the PDF errors at 68% CL. 

5.24 HERAPDFl.O 

In the following sub-section, the tables of relevant cross sections for the HERAPDFl.O PDFs are given 
(Tables 9-10). The predictions for the central value of as{mz) are given in bold. 
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Table 9: Benchmark cross section predictions and uncertainties for HERAPDFl.O for W^,Z,ti and 
Higgs production (120, 180, 240 GeV) at 7 TeV. The central prediction is given in column 2. Errors 
are quoted at the 68% c.l. The column labeled Exp. errors stands for the symmetric experimental 
uncertainty, Model error — for the asymmetric model uncertainty, Param. error — for the asymmetric 
parameterization uncertainty and as error for the uncertainty due to the Aas = 0.002 variation. The 
Total error stands for the total uncertainty calculated by adding the negative and positive variations in 
quadrature. Higgs boson cross sections have been corrected for the finite top mass effect (a factor of 1 .06 
for 120 GeV, 1.15 for 180 GeV and 1.31 for 240 GeV). 
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001 
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Table 10: Benchmark cross section predictions (da/dy x BR in nb) for HERAPDFl.O set calculated at 
a5 = 0.11 76 as a function of boson rapidity. All sources of error calculated above are included. 



5.25 MSTW2008 

In the following sub-section, the tables of relevant cross sections for the MSTW2008 PDFs are given 
(Tables 1 1-14). The predictions for the central value of as{mz) are given in bold. 



asimz) 


aw+ * BR{W+ l+u)[nb] 


aw- * BR{W- l-v)[nb] 


azo * BR{Z" ^ l+l-)[nb] 


0.1187 


5.897 


4.150 


0.9336 


0.1194 


5.927 


4.171 


0.9398 


0.1202 


5.957 


4.190 


0.9442 


0.1208 


5.982 


4.208 


0.9479 


0.1214 


6.008 


4.225 


0.9516 


0.1190 


5.911 


4.160 


0.9374 



Table 1 1 : Benchmark cross section predictions for MSTW 2008 for , Z and ti production at 7 TeV, 
as a function of as{mz). The results for the central value of as{mz) for MSTW 2008 (0.1202) are 
shown in bold. 





f^yy^Il lyg.s (120 Gc V) [pb] 


(^yy^lUyy^-^ii-SO GcV) [pb] 


Cryy^IUgysii-^O Gc'V) [pb] 


f^Ll[pl>] 


0.1187 


12.13 


5.08 


2.74 


163.5 


0.1194 


12.27 


5.14 


2.77 


165.8 


0.1202 


12.41 


5.19 


2.81 


168.1 


0.1208 


12.53 


5.24 


2.83 


170.0 


0.1214 


12.64 


5.29 


2.86 


171.9 


0.1190 


12.18 


5.10 


2.76 


164.4 



Table 12: Benchmark cross section predictions for MSTW 2008 for gg — > Higgs production (masses 
of 120, 180 and 240 GeV), and for tt production, at 7 TeV, as a function of as{mz)- The results for the 
central value of asimz) for MSTW 2008 (0.1202) are shown in bold. Cross sections have been corrected 
for the finite top mass effect (a factor of 1.06 for 120 GeV, 1.15 for 180 GeV and 1.31 for 240 GeV). 
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(Tgg^mggs{24:0 GeV)\pb] 


2.806 


+0.057/-0.069 


0.062 


+0.052/-0.063 


+0.101/-0.077 



Table 13: Benchmark cross section predictions and uncertainties for MSTW 2008 for W^,Z,ti and 
Higgs production (120, 180, 240 GeV) at 7 TeV. The central prediction is given in column 2. Errors 
are quoted at the 68% c.l. The symmetric and asymmetric forms for the PDF errors are given. The ag 
uncertainty is the deviation of the central prediction at the upper and lower 68% c.l. values of as(M|). 
The combination follows the procedure outlined in the text. 
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Table 14: Benchmark cross section predictions (da/dy * BR in nb) for MSTW 2008 for W^, Z° pro- 
duction at 7 TeV, as a function of boson rapidity. 



5.26 NNPDF2.0 

We show results for the NNPDF2.0 PDF set, with the default NNPDF choice of as{mz) = 0.119, as 
well as for other values of {mz) in Table 15 for all the benchmark LHC observables. Note that for each 
value of as we provide the central prediction and the associated PDF uncertainties. For the combined 
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PDF+a^ uncertainty, we assume the benchmark value of Og {mz) = 0.1190 it 0.0012 as a 1-sigma 
uncertainty. 

Next in Table 16 we provide for the same observables the benchmark cross section predictions and 
uncertainties for the NNPDF2.0 set with the combined PDF and strong coupling uncertainties at LHC 7 
TeV. In the last column, the PDF and Ug errors are combined using exact error propagation as discussed 
in Sect. [3^ 

Finally in Table 17 we provide the benchmark cross section predictions for the differential rapidity 
distributions (dfi/dy-BR in pb) for NNPDF2.0 for and production at 7 TeV, as a function of the 
boson rapidity. We provide for each bin in rapidity the combined PDF and uncertainty using exact 
error propagation. 





g{W+)^t{W+ l+ui) 


(j{W-)^v{W- l+vi) 


as=0.115 
as=Q.m 
as=0.119 
as=0.121 
a,=0.123 


5.65 ± 0.13 nb 
5.73 ± 0.13 nb 
5.80 ± 0.15 nb 
5.87 ± 0.13 nb 
5.98 ± 0.14 nb 


3.86 ± 0.09 nb 
3.91 ± 0.08 nb 
3.97 ± 0.09 nb 
4.03 ± 0.08 nb 
4.10 ± 0.10 nb 





fT(ZU)Br(Z+ ^ /+/-) 


cr(tf) 


a,=0.115 
a,=0.117 
as=0.119 

a,=0.121 
a,=0.123 


886 ± 18 pb 
898 ± 16 pb 
909 ± 19 pb 
921 ± 17 pb 
937 ± 21 pb 


156 ± 5 pb 
162 ± 5 pb 
169 ± 6 pb 
176 ± 6 pb 
182 ± 7 pb 





cr(iJ)(120 GeV) 


a[H){l%Q GeV) 


a{H){2m GeV) 


as=0.115 
a,=0.117 
a,=0.119 

as=0.121 
as=0.123 


11.61 ± 0.25 pb 
11.90 ± 0.19 pb 
12.30 ± 0.18 pb 
12.66 ± 0.18 pb 
12.92 ± 0.20 pb 


4.86 ± 0.12 pb 
5.05 ± 0.09 pb 
5.22 ± 0.10 pb 
5.38 ± 0.09 pb 
5.49 ± 0.10 pb 


2.69 ± 0.066 pb 
2.75 ± 0.066 pb 
2.84 ± 0.066 pb 
2.93 ± 0.066 pb 
3.00 ± 0.079 pb 



Table 15: Benchmark cross section predictions for NNPDF2.0 for W^, Z^, ti and Higgs production at 
7 TeV, as a function of as{mz)- The results for the central value of as{mz) for NNPDF2.0 (0.119) 
are shown in bold. For each value of as{mz) we provide both the central prediction and the associated 
1-sigma PDF uncertainties. The Higgs boson cross sections have been corrected for the finite top mass 
effect (1.06,1.15,1.31) for the three values of the Higgs boson mass. 
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0.14 
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2.84 


0.066 


0.052 


0.092 



Table 16: Benchmark cross section predictions and uncertainties for NNPDF2.' for W , Z , tt and 
Higgs production (120, 180, 240 GeV) at LHC 7 TeV. The central prediction is given in column 2. 
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We provide PDF uncertainties as 1-sigma uncertainties. In the last column, the PDF and errors are 
combined using exact error propagation as discussed in Sect. 13.241 The Higgs boson cross sections have 
been corrected for the finite top mass effect (1.06,1.15,1.31) for the three values of the Higgs boson mass. 
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Table 17: Benchmark cross section predictions (dfi/dy-BR in nb) for NNPDF2.0 for W and pro- 
duction at 7 TeV, as a function of boson rapidity. We provide for each bin in rapidity the combined PDF 
and as uncertainty using exact error propagation. 

5.3 Comparison of , W~ , Z° rapidity distributions 

NLO predictions for the , W~ and Z cross sections (along with the PDF uncertainties) are plotted 
as a function of the boson's rapidity are plotted in Figure [TO] using the CTEQ6.6 PDFs. 

In the figures below, a comparison is made for the predictions of the , W~ , Z^ boson rapidity 
distributions for the PDFs discussed in this note. In general, the predictions are in reasonable agreement, 
but differences can be observed that should be detectable with a reasonably-sized data sample at 7 TeV. 

6. Summary 

In this interim report, we have tried to provide a snapshot of our current understanding of PDFs and the 
associated experimental and theoretical uncertainties, and of predictions for benchmark cross sections 
at the LHC (7 TeV) and their corresponding uncertainties. This snapshot will be updated as new input 
data/theoretical treatments become available. Many of the PDFs discussed in this note are now not the 
most recent generation from the respective PDF groups, but we have concentrated on these since they 
are in the most common use by the experimental groups. 
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Fig. 10: The W , W and Z cross sections at 7 TeV (along with the PDF uncertainties) as a function of rapidity for the 
CTEQ6.6 PDFs. The error bars indicate the PDF+cii, uncertainties shown in Table 6. 




Fig. 1 1 : The W cross section at 7 TeV as a function of rapidity for the PDFs discussed in this note, normalized to the result of 
CTEQ6.6. The error bars indicate the PDF(+as/model/...) uncertainties as defined in the tables provided by each PDF group. 
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Fig. 12: The W cross section at 7 TeV as a function of rapidity for the PDFs discussed in this note, normalized to the result 
of MSTW2008. The error bars indicate the PDF(+Qfs/model/...) uncertainties as defined in the tables provided by each PDF 
group. 



■ CTEQ6.6 
ABKMD9 




Fig. 13: The Z° cross section at 7 TeV as a function of rapidity for the PDFs discussed in this note, normalized to the result 
of MSTW2008. The error bars indicate the PDF(+Qfa/model/...) uncertainties as defined in the tables provided by each PDF 
group. 
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